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Transient Expression of an LEDGF/p75 Chimera Retargets 
Lentivector Integration and Functionally Rescues in a 
IVIodel for X-CGD 

Sofia Vets\ Jan De Rijck\ Christian BrendeP, Manuel Grez^ Frederic Bushman^ Zeger Debyser^ and R\k Gijsbers^ 

Retrovirus-based vectors are commonly used as delivery vehicles to correct genetic diseases because of their ability to integrate 
new sequences stably. However, adverse events in which vector integration activates proto-oncogenes, leading to clonal 
expansion and leukemogenesis hamper their application. The host cell-encoded lens epithelium-derived growth factor (LEDGF/ 
p75) binds lentiviral integrase and targets integration to active transcription units. We demonstrated earlier that replacing the 
LEDGF/p75 chromatin interaction domain with an alternative DNA-binding protein could retarget integration. Here, we show that 
transient expression of the chimeric protein using mRNA electroporation efficiently redirects lentiviral vector (LV) integration in 
wild-type (WT) cells. We then employed this technology in a model for X-linked chronic granulomatous disease (X-CGD) using 
myelomonocytic PLB-985 gp91~^~ cells. Following electroporation with mRNA encoding the LEDGF-chimera, the cells were 
treated with a therapeutic lentivector encoding gp91''^''\ Integration site analysis revealed retargeted integration away from 
genes and towards heterochromatin-binding protein ip (CBXI)-binding sites, in regions enriched in marks associated with 
gene silencing. Nevertheless, gpQI''^'''' expression was stable for at least 6 months after electroporation and NADPH-oxidase 
activity was restored to normal levels as determined by superoxide production. Together, these data provide proof-of-principle 
that transient expression of engineered LEDGF-chimera can retarget lentivector integration and rescues the disease phenotype 
in a cell model, opening perspectives for safer gene therapy. 
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Introduction 

Over the past two decades, gene therapy using integrat- 
ing viral vectors has made incredible progress for the cor- 
rection of nnonogenic disorders. However, gene transfer has 
also been associated with some cases of vector-induced 
leukemiaJ-2 Initial clinical trials using Moloney murine leuke- 
mia virus-based retroviral vectors were prone to insertional 
mutagenesis because of their integration site preferences 
close to gene promoters and near known proto-oncogenes 
leading to enhancer-mediated expression. Insertional muta- 
genesis, or genotoxicity, is a major drawback for the use of 
retroviral vectors for gene therapy. 

The first case was reported in a French gene therapy trial 
for X-linked severe combined immunodeficiency where Molo- 
ney murine leukemia virus-based retroviral vectors were used 
to express IL2Ry in CD34+ bone marrow progenitor cells. ^ In 
total, 5 out of 20 treated patients developed T-cell leukemia 
due to integration in the proximity of proto-oncogenes, like 
LIM domain only 2 {LM02)?'^ In addition to this malignant 
integration event, other acquired mutations played an impor- 
tant role in leukemia development including gain-of-function 
mutation in NOTCH1, deletion of the tumor suppressor gene 
locus cyclin-dependent kinase 2A (CDKN2A), and transloca- 
tion of the TCR-p (T-cell receptor) region to the STIL-TAL1 
(STIL, stem cell leukemia gene/TAL-1 interrupting locus; TAL, 
T-cell acute lymphocytic leukemia) locus.^ Unfortunately, 



genotoxicity was not confined to X-linked severe combined 
immunodeficiency. Recent reports of insertional mutagenesis 
leading to myelodysplastic syndrome in a trial for X-linked 
chronic granulomatous disease (X-CGD)^ and a case of leu- 
kemia in a trial for Wiskott-Aldrich syndrome,^ underscored 
that this type of toxicity can be found in other disease settings, 
raising concerns regarding the safety of the current thera- 
pies. The cause of the adverse event in case of X-CGD was 
pinpointed to retroviral-mediated transcriptional activation of 
ecotropic viral integration site 1 (EVI1),^ and the same genes 
as found in X-linked severe combined immunodeficiency- and 
X-CGD-related genotoxicity, were found to be targeted in a 
gene therapy trial treating Wiskott-Aldrich syndrome. 

In that sense, lentivirus-based viral vectors were generally 
believed to be safer because of their favored integration into 
the body of transcription units, and because HIV infection is 
not associated with insertional activation of proto-oncogenes. 
However, lentiviral vectors (LVs) can still lead to insertional 
mutagenesis. A clonal expansion of erythroid cells was 
reported in a p-thalassaemia trial after lentiviral integration 
in the high mobility group adenine/thymine (AT)-hook 2 gene 
(HMGA2).^ Although the clonal expansion is still benign, it 
emphasizes that with these therapeutic benefits also come 
significant clinical risks and underscores the need to develop 
strategies to improve safety of LVs. 

Retroviral integration in the host genome is not a random 
process, but is specific for the retrovirus used: y-retroviral 
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Figure 1 Schematic representation of the LEDGF/p75 domain structure and of the CBX1-LEDGF325.53(j fusion, (a) LEDGF/p75 
carries a conserved PWWP-domain and several charged regions (CR) at its N-terminus. Together with the nuclear localization signal 
(NLS) and the AT hook-like domains (AT), these elements form the DNA-binding domain of LEDGF/p75. The C-terminal IBD domain is 
responsible for the interaction with HIV-IN. D366 is essential for the interaction with HIV-IN (indicated by an arrowhead): mutation of this 
residue to Asn (D366N) results in loss of interaction, (b) The CBXI-LEDGF325.530 and GBXI-LEDGF325.530 D366N fusions are depicted. All 
protein elements are drawn to scale. Numbers indicate amino acids. CBX1, heterochromatin protein 1p; IBD, integrase-binding domain; IN, 
integrase; LEDGF/p75, lens epithelium-derived growth factor; PWWP, Pro-Trp-Trp-Pro domain. 



vectors (like Moloney nnurine leukennia virus) preferentially 
integrate in pronnoter-proxinnal regions, thereby affecting 
pronnoter activity,^ whereas LVs tend to integrate in active 
transcription units, disfavoring pronnoter regions.^ Under- 
standing the mechanisms that dictate integration site selec- 
tion will increase our insight in retrovirology and help the 
development of gene therapy. For lentivirus-based vectors, 
the integration bias is triggered by the cellular cofactor lens 
epithelium-derived growth factor (LEDGF/p75).9 LEDGF/ 
p75, an epigenetic reader recognizing H3K36me3 marks 
on the chromatin through its N-terminal PWWP domain, 
binds downstream of the transcription start site of active 
transcription units. The ubiquitously expressed protein 
consists of an ensemble of chromatin-binding motifs includ- 
ing the PWWP domain, ^^'^2 AT-hook-like motifs, and three 
charged regions (CR1-3)^'^ (Figure 1a) at its N-terminal 
end. Through a protein-binding domain at the C-terminal 
end, called integrase (IN)-binding domain (aa 347-429),^^ 
LEDGF/p75 interacts with lentiviral integrases^^'^^ and binds 
to several cellular proteins including JP02,^^'^^ PogZ,^^ 
Cdc-7/ASK,2o and MLL/menin.^^ 

Artificial fusion proteins in which the LEDGF/p75 chromatin- 
binding domain is replaced with alternative chromatin interac- 
tion domains retarget integration towards regions bound by 
the respective chromatin-binding domain.^^-^"^ We previously 
engineered an artificial LEDGF/p75-based tether by fusing 
the C-terminal IN-binding fragment of LEDGF/p75 (LEDGFg^^. 
53q) to heterochromatin-binding protein 1p (CBX1), resulting in 
CBX1 -LEDGF325.530 (Figure 1b).22 CBX1 binds di- and trimeth- 
ylated H3K9^^ and locates to centromeric heterochromatin and 
transcriptionally silent regions, generally disfavored for lenti- 
viral integration. Expression of CBXI-LEDGF325.53Q in LEDGF/ 
p75-depleted HeLaP4 cells (referred to as LEDGF KD cells) 
retargeted LV integration towards regions enriched in CBX1- 
binding sites.^^ Although, this study provided proof-of-principle 
for functional retargeting of LV integration, the system required 
two supplemental viral vector transductions: one vector to 



deplete endogenous LEDGF/p75, and a second to express 
the retargeting CBXI-LEDGF325.530 tether. 

In an effort to develop safer gene therapy, we have evalu- 
ated mRNA electroporation to transiently overexpress CBX1- 
LEDGF325.530, both in LEDGF KD and in wild-type (WT) cells, 
and demonstrated retargeting of LV integration. In addition, we 
validated this technology in a cellular model for X-CGD. X-CGD 
is an immunodeficiency disorder, where a mutation in the gene- 
encoding gpQlP^^'^ affects phagocytic neutrophils by limiting the 
generation of reactive oxygen species which is essential for 
their microbicidal activity. The X-CGD model was generated by 
disrupting the gpQIP^^'^ gene using homologous recombination 
in myelomonocytic PLB-985 cells (X-CGD PLB-985).26 After 
electroporation of X-CGD PLB-985 cells with CBXI-LEDGF325. 
53Q mRNA, we transduced the cells with a therapeutic lentivector 
encoding a functional copy of gp91. Integration site analysis 
demonstrated retargeting and rescue of the disease phenotype 
was indicated by restoration of superoxide production. Analysis 
of single clones derived from these cells confirmed functional 
rescue. Together, these findings demonstrate that integration 
of gene therapy vectors can be retargeted towards gene-poor 
regions, where the genotoxicity may be reduced, without ham- 
pering functional rescue of the disease phenotype. 

Results 

mRNA electroporation results in efficient CBX1- 
LEDGF325.530 expression 

We previously reported that lentiviral integration can be retar- 
geted.22 Next to the therapeutic viral vector, this technology 
required two additional viral vector transductions: one for 
stable knockdown of the natural tether, LEDGF/p75, and a 
second for stable overexpression of the designed tether, e.g., 
CBX1 fused to the C-terminus of LEDGF/p75 (referred to as 
CBX1-LEDGF3,,.,3„). 

In a subsequent step towards safer gene therapy, we eval- 
uated mRNA electroporation to transiently express CBX1- 
LEDGF325.53Q. mRNA was produced from a modified pST1 
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Figure 2 Translation of electroporated mRNA in LEDGF/p75 KD HeLaP4 cells. Confocal imaging and immunocytochemistry were 
used to evaluate peak protein expression for the indicated constructs following mRNA electroporation of LEDGF-depleted HeLaP4 cells. 
Images were taken at 1 , 2, 3, and 4 days after electroporation. Left panels demonstrate expression of CBX1 -LEDGF325.530, CBX1 -LEDGPg^^. 
53qD366N, and LEDGF/p75 as detected with A300-848a antibody. Right panels demonstrate DAPI-stained nuclei. CBX1 , heterochromatin 
protein 1p; DAPI, 4',6-diamidino-2-phenylindole; LEDGF/p75, lens epithelium-derived growth factor. 



plasnnid tennplate^^ that pronnotes nnRNA stability and effi- 
cient translation. In parallel, the nnRNA for full-length LEDGF/ 
p75 and the interaction-deficient CBXI-LEDGF325.530D366N 
nnutant were generated as controls (Figure 1b). Residue 
D366 in LEDGF/p75 is pivotal for the interaction with IN, and 
substitution with Asn (D366N) abolishes the interaction. In 
addition, eGFP (enhanced green fluorescent protein) nnRNA 
was produced to spike the electroporation nnixture of the dif- 
ferent constructs to assess the electroporation efficiency. 

First, the different nnRNA constructs were electroporated in 
LEDGF KD HeLaP4 cells (referred to as LEDGF KD). Efficient 
nnRNA electroporation was confirnned by eGFP expression 
in 100% of the cells as observed by fluorescence-activated 
cell sorting analysis (data not shown). Expression of the 
different proteins was evaluated by western blot analysis 
(Supplementary Figure S1). LEDGF/p75 was absent in the 
LEDGF KD cell line,^^ whereas nnRNA electroporation resulted 
in protein bands nnigrating at the predicted nnolecular weight. In 
parallel, subcellular distribution was evaluated by innnnunocy- 
tochennistry (Figure 2). One day after nnRNA electroporation, 
CBX1-LEDGF3,,.,3„, CBX1-LEDGF3,,.,3„D366N, and LEDGF/ 
p75 were expressed (Figure 2).^^ We also evaluated the turn- 
over of the LEDGF325.53Q hybrids by nnonitoring protein expres- 
sion levels on 2, 3, and 4 days post-electroporation (Figure 2). 
All fusion proteins were abundantly present in the nuclei of the 
cells at 24 and 48 hours after electroporation. Expression lev- 
els decreased only fronn day 3 onwards, and were nearly conn- 
pletely absent at day 4 post-electroporation, corroborating the 
transient nature of expression after nnRNA electroporation. 

Transient expression of CBXI-LEDGF325.530 rescues LV 
integration in LEDGF KD cells and retargets integration 
away from genes 

After demonstrating efficient and transient translation of the 
electroporated mRNA, we investigated whether lentivec- 
tor transduction was supported. We transduced LEDGF KD 



HeLaP4 cells 24 hours after mRNA electroporation with a len- 
tiviral reporter vector (LV_eGFP-T2A-fLuc).^^ Integrated provi- 
ral vector copies were quantified by quantitative PGR (qPCR) 
(Supplementary Figure S2a). Electroporation of CBX1- 
LEDGF325.53Q and LEDGF/p75 mRNA significantly rescued len- 
tivector integration compared with mock-electroporated cells 
(1.7- and 1.8-fold more integrated copies, P= 0.0006 and P = 
0.0199, respectively), whereas the interaction-deficient CBX1- 
LEDGF325.53QD366N did not rescue integration. 

We next asked whether transient CBXI-LEDGF325.530 
expression retargeted integration to genomic sites bound 
by CBX1. Integration sites were amplified and analyzed as 
described previously.^^ Random control sites were gener- 
ated computationally, and matched to experimental sites 
with respect to the distance to the nearest Mse\ cleav- 
age site (matched random control (MRC)). In the analysis 
that follows, the distribution of experimental LV integration 
sites is normalized to that of the MRC sites, as a control 
for recovery bias due to cleavage by restriction enzymes. 
First, we evaluated LV integration frequencies following 
transient CBXI-LEDGF325.53Q expression in RefSeq tran- 
scription units (Supplementary Table S1). In line with pre- 
vious reports,22'2°'2^ integration in LEDGF KD cells occurred 
significantly less in transcription units (57.3% integration 
in RefSeq genes compared with 73.3% in WT cells (P < 
0.001)). Mock electroporation (no mRNA) did not alter inte- 
gration in RefSeq genes (58.3% compared with 57.3% in 
KD cells, P = 0.6452). Electroporation with the interaction- 
deficient CBXI-LEDGF325.530D366N mRNA altered inte- 
gration in RefSeq genes only slightly (62.6% compared 
with 58.3% in mock, P = 0.0261). Upon electroporation of 
LEDGF/p75 mRNA, integration in RefSeq genes was res- 
cued to WT levels (78.7% for LEDGF/p75). In line with previ- 
ous data, 22 upon CBXI-LEDGF325.53Q mRNA electroporation, 
integration was significantly disfavored in transcription units 
(39.9%, P = 0.8946 compared with MRC), consistent with 
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Figure 3 Transient expression of CBX1-LEDGF325.53(, retargets LV integration into CBX1-rich heterochromatin regions, (a) Western 
blot analysis showing protein expression of CBX1-LEDGF325.53q(D366N) and endogenous LEDGF/p75 (mock) compared with WT HeLaP4 
cells 48 hours after mRNA electroporation. Loading was controlled by a-tubulin. (b) Following mRNA electroporation, WT HeLaP4 cells were 
transduced with LV_eGFP-T2A-fLuc. Integrated vector copies were measured by quantitative PGR. (c) Correlations were made between 
integration sites and the density of a panel of 39 histone modifications. Associations of integration and histone methylation/acetylation 
were quantified using ROC areas, comparing the association of integration site data sets with the frequency in corresponding MRC sets. 
A ROC area scale is shown along the bottom of the panel. Tile color indicates whether a specific feature is favored (blue, enrichment 
relative to random) or disfavored (yellow, negative correlation compared with random) for integration (10 kb window) in the respective data 
sets relative to their MRCs. P values showing significance of departures from mock-treated WT cells (overlaid with dashes) are shown 
with asterisks (***p< 0.001, Wald statistics referred to distribution compared with mock). CBX1, heterochromatin protein 1p; LEDGF/ 
p75, lens epithelium-derived growth factor; LV, lentiviral vector; MRC, matched random control; ROC, receiver operating characteristic; WT, 
wild-type. 



the distribution pattern of GBX1 in heterochronnatic regions, 
which are generally gene-poor. In addition, we evaluated 
integration site distribution relative to histone modifications, 
ennploying high-resolution nnaps for the genome-wide dis- 
tribution of 39 histone modifications determined in human 
CD4+ T-cells.22'22 In accordance, integration was enriched 
near histone modifications bound by CBX1 (Supplemen- 
tary Figure S2b; H3K9me2 and H3K9me3), and near 
markers associated with transcriptionally silent regions or 
heterochromatin (H4K20me3 and H3K79me3), whereas 
the D366N control did not (Supplementary Figure S2b; 
compare CBX1 -LEDOFg^^ and CBX1 -LEDGFg^^ 530D366N 
with LEDGF KD). 



CBXI-LEDGF325.53Q mRNA electroporation retargets inte- 
gration to more favorable, intergenic regions in WT cells 

In a next step, we employed the mRNA electroporation 
strategy to retarget vector integration in WT HeLaP4 cells, 
expressing endogenous LEDGF/p75, using CBXI-LEDGF325. 

and CBXI-LEDGF325.530D366N mRNA. Fusion proteins 
were expressed (western blot analysis; Figure 3a) and LV 
integration frequency was comparable to mock (integrated 
proviral copies; Figure 3b). Analysis of integration site distri- 
bution revealed high frequency of integration in transcription 
units in WT cells and WT cells that were mock-electroporated 
(Table 1; 73.3 and 75.2% in RefSeq, respectively). Electropo- 
ration of WT cells with CBX1 -LEDGF,,,,,, mRNA resulted in 
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Table 1 Integration frequency near mapped genomic features in tlie genome in HeLaP4 cells 









Number 


%ln 


%<10kb 


% <50 kb 


% <250 kb 




Cell line 


mRNA electroporation 


of sites 


RefSeq 


TSS 


oncogene 


oncogene 


Experimental LV sites 


WT 




1,174 


73.3 


23.68 


12.95 


37.65 




WT 


Mock 


2,148 


75.2 


25.14 


14.01 


43.11 




WT 


CBX1-LEDGF3,„3„ 


2,788 


54.1*** 


17.25*** 


9.97*** 


33.68*** 




WT 


CBX1-LEDGF3,„3„D366N 


3,501 


73.6 


23.28 


12.94 


39.76 


MRC sites 


WT 




3,522 


39.2 


12.07 


4.68 


17.89 




WT 


Mock 


6,441 


39.5 


10.79 


4.72 


18.48 




WT 


CBX1-LEDGF3,,.,3„ 


8,361 


39.8 


11.76 


5.44 


19.18 




WT 


CBX1-LEDGF3,,.,3P366N 


10,503 


39.8 


11.47 


4.89 


18.75 



All datasets differed significantly from their respective MRCs (Fisher's exact test; P< 0.001). Significant deviation from mock in the Fisher's exact test is denoted 
by ***P< 0.001. 

Abbreviations: CBX1, heterochromatin protein 1(3; LEDGF, lens epithelium-derived growth factor; LV, lentiviral vector; MRC, matched random control; TSS, 
transcription start site; WT, wild-type. 



significantly less integration in transcription units (54.1%, P< 
0.001, compared with WT or mock), whereas expression of 
CBXI-LEDGF325.530D366N control did not affect distribution 
compared with WT (73.6%, P= 0.8869). In addition to integra- 
tion in RefSeq genes, we evaluated integration near genomic 
features that would indicate an increased likelihood of dereg- 
ulation of the neighboring genes, such as transcription start 
sites and oncogenes. Integration within 1 0 kb of the transcrip- 
tion start site of protein-coding genes and near the 5'-end of 
known proto-oncogenes or tumor suppressor genes (within 
50 or 250 kb distance) occurred significantly less frequently 
following CBX1 -LEDGF325.530 mRNA electroporation (Table 1 ; 
P < 0.001 when compared with WT or mock), whereas the 
CBX1-LEDGF^,,„^D366N mRNA control condition was 
indistinguishable from WT. More detailed analysis showed 
that integration associated with histone modifications bound 
by CBX1 (H4K20me3, H3K9me3, and H3K9me2; Figure 3c). 
In WT or mock-electroporated cells, LV integration frequency 
negatively associated with H3K9me2 and H3K9me3 histone 
modifications (yellow tiles) and positively correlated with his- 
tone modifications generally associated with active transcrip- 
tion, such as all acetylations and some histone methylations 
(blue tiles. Figure 3c). Transient CBX1-LEDGF325_53q expres- 
sion during vector transduction reversed most of these corre- 
lations (***P< 0.001 , Wald statistics referred to distribution 
compared with mock; Figure 3c), underscoring redistribu- 
tion of integration sites, shifting more towards random (less 
intense yellow and blue tiles). The pattern in CBX1 -LEDGF325. 
53qD366N control cells was not different from WT cells. Similar 
data were obtained in SupTI cells (data not shown). These 
results demonstrate that retargeting of viral vector integration 
employing transient CBXI-LEDGF325.53Q expression in WT 
cells is feasible. 

Retargeted integration of a therapeutic vector (LV_gp91) 
following mRNA electroporation rescues a model for 
X-CGD 

Subsequently, we validated our technology in human myel- 
omonocytic X-CGD PLB-985 cells (further referred to as 
X-CGD PLB),26 a cell model of X-CGD with a disrupted CYBB 
gene, the gene encoding the gp91P^°'' subunit of the phago- 
cyte NADPH-oxidase complex. WT PLB-985 cells were com- 
pared as a control (referred to as PLB WT). X-CGD PLB cells 



were electroporated with CBXI-LEDGF325.530 and CBX1- 
LEDGF325.530D366N mRNA, respectively, and transduced 
the next day with a LV encoding codon-optimized gp91P^°^ 
(LV_gp91 ),^^ or a control vector expressing eGFP (LV_eGFP) 
(Figure 4a). Western blot analysis for the respective LEDGF 
fusions showed bands at the expected molecular weight 
(data not shown). Transduction efficiency was determined 
by flow cytometry, monitoring gp91 expression (Figure 4b). 
Transduction of X-CGD PLB cells electroporated with CBX1- 
LEDGF325.530 or D366N control mRNA resulted in equal 
gp91 -positive cells as PLB WT (91.8 and 91.1% gp91^ cells, 
respectively, compared with 91.4% in PLB WT; Figure 4b) 
and comparable gp91 expression levels (1 .4 x 10^ and 1 .5 x 
10^ arbitrary units as measured by mean fluorescence inten- 
sity, respectively). In addition, comparable integrated vector 
copies were detected by qPCR in X-CGD PLB cells elec- 
troporated with CBXI-LEDGF325.530 or D366N control mRNA 
(3.6 and 3.15 vector copy numbers (VCN), respectively). 
Next, LV integration sites were determined in the respective 
cell lines for LV_gp91 and LV_eGFP (Table 2). In line with 
our previous results (Table 1), the vector integration in mock- 
treated X-CGD PLB cells preferentially occurred in RefSeq 
genes (Table 2, 80.65%) and was similar to CBXI-LEDGF325. 
53qD366N mRNA-electroporated cells (79.87 and 78.8% for 
LV_eGFP and LV_gp91, respectively), whereas integration 
shifted out of RefSeq transcription units in cells electropo- 
rated with CBXI-LEDGF325.530 mRNA (Table 2; 59.36 and 
56.22% for LV_eGFP and LV_gp91, respectively). Although 
integration shifted towards random and the reduction was 
significant (***P< 0.001 , for LV_eGFP and LV_gp91 compar- 
ing CBXI-LEDGF325.53Q mRNA electroporation to mock), inte- 
gration events were still favored in transcription units (***P < 
0.001 compared with MRC). Although CBXI-LEDGF325.530 
mRNA electroporation did not alter the integration frequency 
within 10 kb of the transcription start site of protein-coding 
genes in X-CGD PLB-985 cells, integration occurred signif- 
icantly less frequently within 50 or 250 kb distance of the 
5'-end of known proto-oncogenes or tumor suppressor genes 
(Table 2; P< 0.001 when compared with mock). Again, elec- 
troporation with the CBXI-LEDGF325.530D366N mRNA control 
did not affect integration near these features compared with 
mock. Likewise, integration occurred more frequently near 
markers for silenced chromatin (H3K9me2, H3K9me3, and 
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Figure 4 Vector constructs and expression efficiency after electroporation. After CBX1-LEDGF325.53o(D366N) mRNA electroporation, 
X-CGD PLB-985 cells were transduced with LV_gp91 or LV_eGFP and subsequently differentiated into granulocytes, (a) Schematic 
representation of transfer plasmids used for vector production. The SFFV promoter drives gp91 or eGFP expression. All lentiviral vectors 
are based on HIV-1^l4 3- (b) Flow cytometry was used to determine expression levels of gp91 (7D5 antibody) in electroporated cells. GBX1 , 
heterochromatin protein 1 p; eGFP, enhanced green fluorescent protein; gp91 , glycoprotein 91 ; LEDGF/p75, lens epithelium-derived growth 
factor; LTR, long terminal repeat; LV, lentiviral vector; SFFV, spleen focus-forming virus; SIN, self-inactivating; X-CGD, X-linked chronic 
granulomatous disease; \|/, packaging signal. 



Table 2 Integration frequency near mapped genomic features in X-CGD PLB cells 





Cell line 


mRNA electroporation 


Transduction 


Number of 
sites 


% In RefSeq 


%<10kb 
TSS 


% <50 kb 
oncogene 


% <250 kb 
oncogene 


Experimental LV sites 


X-CGD PLB 


Mock 


LV_eGFP 


2,103 


80.65 


25.06 


15.45 


43.32 




X-CGD PLB 


CBX1-LEDGF3,,.,3„ 


LV_eGFP 


1,095 


59.36*** 


23.01 


11.60** 


38.63** 








LV_gp91 


1,928 


56.22*** 


25.38 


11.68*** 


38.30** 




X-CGD PLB 


CBX1-LEDGF3,„3„D366N 


LV_eGFP 


2,221 


79.87 


25.98 


13.46 


43.18 








LV_gp91 


2,807 


78.8 


25.58 


14.46 


42.43 


MRC sites 


X-CGD PLB 


Mock 


LV_eGFP 


8,411 


40.77 


11.45 


5.10 


18.13 




X-CGD PLB 


CBX1-LEDGF3,3.^3„ 


LV_eGFP 


4,380 


40.50 


11.03 


4.66 


18.26 








LV_gp91 


7,708 


40.25 


11.53 


5.11 


17.54 




X-CGD PLB 


CBX1-LEDGF3,,.,3„D366N 


LV_eGFP 


8,880 


40.23 


11.11 


4.89 


18.39 








LV_gp91 


1 1 ,228 


39.72 


11.36 


4.57 


17.91 



All datasets differed significantly from their respective MRCs (Fisher's exact test; P< 0.001). Significant deviation from mock in the Fisher's exact test is denoted 
by **P< 0.01 ;***P< 0.001. 

Abbreviations: CBX1, heterochromatin protein 1(3; eGFP, enhanced green fluorescent protein; LEDGF, lens epithelium-derived growth factor; LV, lentiviral vector; 
MRC, matched random control; TSS, transcription start site; WT, wild-type. 
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Figure 5 Transient expression of CBXI-LEDGF325.530 retargets LV integration into CBX1-rich heterochromatin regions in X-CGD PLB 
cells. Correlations were made between integration sites and the density of a panel of 39 histone modifications (Figure 3c). Pvalues showing 
significance of departures from mock-treated WT cells (overlaid with dashes) are shown with asterisks (***p< 0.OOI , Wald statistics referred 
to distribution compared with mock). CBX1 , heterochromatin protein 1p; eGFR enhanced green fluorescent protein; gp91 , glycoprotein 91 ; 
LEDGF/p75, lens epithelium-derived growth factor; LV, lentiviral vector. 



H4K20nne3; ***P < 0.001 , Wald statistics referred to distri- 
bution connpared with nnock) and less in regions rich in nnark- 
ers for active chromatin following CBXI-LEDGF325.53Q mRNA 
electroporation (Figure 5; compare CBXI-LEDGF325.53Q and 
CBXI-LEDGF325.530D366N with mock) corroborating redistri- 
bution of integration sites out of transcription units and more 
random. 

Next, we verified whether the expression of the retargeted 
LV_gp91 was sufficient to reconstitute the superoxide pro- 
duction. Cells were treated with dimethyl sulfoxide to induce 
granulocytic differentiation. Differentiation was comparable for 
all cell lines as measured by flow cytometry (Supplementary 



Figure S3; upper quadrants, GD1 1 b+). Subsequently, NADPH- 
oxidase activity was assessed using the cytochrome C assay. 
Contrary to LV_eGFP cells, cells transduced with LV_gp91 
reached comparable superoxide levels as in PLB WT cells 
(about 2.8 nmol/min x 10^ cells; Figure 6a), irrespective 
of the mRNA used for electroporation, indicating that even 
though the therapeutic vector was retargeted, NADPH-oxi- 
dase activity was restored to WT levels. In parallel, we evalu- 
ated the potential of the different gp91 -reconstituted cells to 
reduce nitroblue tetrazolium (NBT) to NBT-formazan (blue 
dye). X-CGD PLB cells transduced with LV_gp91 reduced 
NBT as efficient as WT PLB cells, irrespective of retargeted 
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Figure 6 gp91 expression rescues superoxide production after retargeting to heterochromatin. Superoxide production was 
measured in the transduced and electroporated X-CGD PLB cells, (a) Superoxide production was measured by the continuous cytochrome 
C reduction assay, (b) Percentage of NBT formazan-positive cells are depicted after reduction of NBT as measure for superoxide production. 
Mean ± SD of three measurements are shown, (c) Monoclonals were grown from the LV_gp91 -transduced CBXI-LEDGFg^g.ggQCDSeSN)- 
electroporated X-CGD PLB cells. MFI of gp91 expression perVCN.The MFI determined from the different gp91 -expressing monoclonals 
was divided by the VCN of the respective monoclonal, (d) Superoxide production was measured by the continuous cytochrome C reduction 
assay for all different monoclonals and normalized to VCNs. CBX1 , heterochromatin protein 1 p; eGFP, enhanced green fluorescent protein; 
gp91, glycoprotein 91; LEDGF/p75, lens epithelium-derived growth factor; MFI, mean fluorescence intensity; NBT, nitroblue tetrazolium; 
VCN, vector copy number; WT, wild-type; X-CGD, X-linked chronic granulomatous disease. 



integration (86% for both electroporated cells connpared with 
81% in PLB WT cells, Figure 6b), whereas no reduction was 
observed in cells transduced with LV_eGFP control vector. 
Together, these results indicate that even when vector inte- 
gration is altered, retargeted out of transcription units, and 
towards regions enriched in heterochronnatin-associated 
nnarks, equal integrated VCNs are obtained and transgene 
expression is sufficient to rescue a disease phenotype. 

Alternatively, the observed rescue of superoxide produc- 
tion can be explained by a few WT integrations nnediated 
by endogenous LEDGF/p75. Single clones of LV_gp91 
X-CGD PLB cells were grown by linniting dilution (10 clones 
derived fronn cells electroporated with CBXI-LEDGF325.53Q 
nnRNA and 9 clones fronn cells electroporated with CBX1- 
LEDGF325.530D366N nnRNA). Individual clones were char- 
acterized by deternnining VCN, integration site distribution, 
gp91 expression levels, and production of reactive oxygen 
species. Integrated proviral copies were in the sanne range, 
with CBXI-LEDGF325.530 cells carrying 1-4 VCN and CBX1- 
LEDGF325.530D366N cells carrying 2-3 VCN (Supplementary 
Table S2). One CBXI-LEDGF325.530D366N clone did not 



carry an integrated provirus and thus was excluded fronn 
the analysis. gp91 expression levels were deternnined for 
cells electroporated either with CBXI-LEDGF325.53Q or with 
CBXI-LEDGF325.530D366N mRNA and normalized for VCN 
(Figure 6c). Although gp91 expression in both groups was not 
significantly different (P= 0.0565; unpaired two-sided f-test), 
nnore variance of gene expression levels was observed in 
clones obtained after electroporation with CBXI-LEDGF325. 
53Q nnRNA when connpared with CBXI-LEDGF325.530D366N. 
Individual clones were cultured in the presence of dinnethyl 
sulfoxide to induce nnyeloid differentiation and to score their 
capacity to produce superoxide and to rescue the X-CGD 
phenotype using the cytochronne C assay (Figure 6d). LV_ 
gp91 -transduced cells derived fronn CBXI-LEDGF325.53Q and 
CBX1 -LEDGF325.530D366N mRNA-electroporated X-CGD 
PLB cells produced equal levels of superoxide (P= 0.3674), 
even though reactive oxygen species production levels dif- 
fered nnore in the cells generated using CBXI-LEDGF325.53Q 
nnRNA. In addition, we deternnined the integration sites of 
the LV_gp91 vectors in the single clones, resulting in 18 and 
14 unique sites for CBXI-LEDGF325.530 and CBXI-LEDGF325. 
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53qD366N mRNA, respectively (Supplementary Table S3). 

Analysis of integration site distribution underscored retargeted 
integration in clones electroporated with CBXI-LEDGF325. 

mRNA: 66.67% integration in RefSeq genes for clones 
grown following CBXI-LEDGF325.53Q mRNA electroporation 
compared with 78.57% for CBXI-LEDGF325.530D366N clones. 
Also, integration in the CBXI-LEDGF325.53Q clones occurred 
more frequently near H4K20me3, H3K9me3, and H3K9me2 
histone modifications that are recognized by GBX1 (data not 
shown). 

These results indicate that even when integration is retar- 
geted and takes place in heterochromatin, transgene expres- 
sion is sufficient to rescue a disease phenotype regardless of 
integration site placement and expression levels. 

Discussion 

Clinical trials to treat several blood cell diseases have dem- 
onstrated that long-term gene correction is feasible and in 
some patients may provide equal clinical benefit with less 
risk as compared with standard treatments. Alongside, the 
therapeutic benefits has also come with the recognition of 
associated clinical risks. Initial clinical trials using y-retroviral 
vectors were prone to insertional mutagenesis, in part due 
to their integration site preferences close to gene promoters 
and near proto-oncogenes leading to enhancer-mediated 
expression. Accordingly, LVs were generally believed to be 
safer because of their favored integration into the body of 
active transcription units. However, LV integration can lead to 
insertional mutagenesis and clonal expansion, as reported 
in the p-thalassaemia trial, ^ which underscores the need to 
develop strategies that improve the safety of viral vectors. 

The increasingly sophisticated understanding of the mech- 
anisms by which the therapeutic vectors caused leukemia, 
provides new avenues to develop vectors that maintain or 
even increase the efficacy seen in the first-generation retro- 
viral vectors but that potentially offer a safer alternative. Sev- 
eral approaches have been pursued to overcome the issues 
accompanying the current therapies: insulator elements have 
been introduced to act as both enhancer blockers and bound- 
aries against potential silencing of integrating viral vectors;^^ 
genetic regions have been selected that might function as 
safe harbors for integration;^^ alternative integration systems 
with a more random integration profile have been developed, 
such as foamy virus or a-retroviral vectors;^^'^^ progress 
has been made in transposon-based gene transfer;^^ spe- 
cific promoters have been selected to support cell-specific 
expression^'^° and to limit unwanted transgene expression in 
off-target cells;"^^ and alternative tethers have been generated 
that redirect integration towards defined genetic regions.^^-^^ 

We previously reported retargeting of lentiviral integra- 
tion in LEDGF/p75 KD HeLaP4 cells stably overexpressing 
CBXI-LEDGF325.53Q. That system required two extra vectors: 
one to create the LEDGF/p75 KD and another to express 
CBX1-LEDGF,,,„,. With the cellular function of LEDGF/ 
p75 remaining largely unknown, modification of LEDGF/p75 
expression levels (stable knockdown or overexpression of 
artificial chimera) is likely to interfere with the natural func- 
tion of LEDGF/p75. Indeed, akin to its effect on HIV-1 IN, 
LEDGF/p75 orchestrates the chromatin association of several 



proteins or protein complexes, such as JP02,^^ pogZ,^^ MLI_/ 
menin,^^ and Cdc7-ASK.2° Overexpression of the IN-binding 
C-terminal end of the LEDGF/p75 protein might affect these 
interactions and thus their downstream pathways. In a next 
step towards safer gene therapy, we therefore opted for tran- 
sient overexpression of CBXI-LEDGF325.53Q using mRNA 
electroporation and proved that transient expression is suf- 
ficient to redirect integration towards regions enriched in 
CBX1 -binding sites, both in LEDGF/p75 KD (Supplementary 
Table S1 and Supplementary Figure S2) and in WT (Table 1 
and Figure 3) cells. 

Moreover, we reported retargeting of LV_gp91 in GBX1- 
LEDGF325.530 mRNA-electroporated X-CGD PLB-985 cells, 
a cell model for X-CGD (Table 2 and Figure 5) and dem- 
onstrated functional rescue of the disease phenotype after 
retargeting to CBX1 -binding sites (Figure 6). Although a 
clear shift in integration out of RefSeq genes was detected 
(80.65 and 56.22% for mock and CBXI-LEDGF325.530 mRNA- 
electroporated cells, respectively), the frequency in the 
presence of the engineered was still different from random 
(40.24% for MRC).This can be explained by residual integra- 
tion targeted by endogenous LEDGF/p75. To overcome this, 
CBXI-LEDGF325.53Q mRNA electroporation may be combined 
with electroporation of small interfering RNA targeting the 
endogenous LEDGF/p75 mRNA. 

One question that remained was whether the functional 
rescue observed in the cytochrome C and the NBT assay 
(Figure 6a,b) originated from integrations in hetero- or 
euchromatin, and whether the integration event was medi- 
ated by endogenous LEDGF/p75 or CBXI-LEDGF325.530. To 
address this issue, we generated single clones and evalu- 
ated phenotypic rescue. Our results demonstrated that inte- 
grated VCNs and gp91 expression levels were comparable for 
X-CGD PLB-985 cells electroporated with CBXI-LEDGF325 
530 and CBXI-LEDGF325.530 D366N mRNA (Figures 4 and 
6c). The fact that these measurements were performed on 
cells that were cultured for at least 6 months underscores 
that, even with retargeted integration (out of transcription 
units and towards regions enriched in heterochromatin-as- 
sociated marks), expression levels are sufficient to rescue 
the disease phenotype and are not more silenced over time 
than with WT integration. However, the increased variation 
in gene expression per VCN in single clones obtained after 
CBXI-LEDGF325.530 mRNA electroporation may reflect the 
impact of integration near heterochromatin markers on vec- 
tor gene expression, also referred to as position effect var- 
iegation. Indeed, the spleen focus-forming virus promoter is 
reported to be prone to silencing on the long-term. ^ Although 
this does not lead to significantly lower reactive oxygen spe- 
cies production in our setting (gp91 is not rate-limiting at high 
expression levels),^^ this effect may become more relevant 
when weaker promoters are used. One strategy to overcome 
this issue is the inclusion of a ubiquitous acting chromatin- 
opening element into the LV, an element that has been shown 
to reduce position effect variegation and to boost expression 
levels.^"^ Alternatively, insulator sequences could be included 
in the long terminal repeat to achieve a similar effect.'^^ 

Electroporation-mediated mRNA transfection is an efficient 
approach for gene expression that does not result in perma- 
nent genetic modification of cells. Successful gene transfer by 
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mRNA electroporation into primary T lymphocytes has been 
demonstrated forT-cell receptor^^ and chimeric antigen recep- 
tors.^^ Likewise, mRNA-mediated gene delivery into human 
CD34+ progenitor cells promoted efficient protein expression, 
allowing transitory manipulation of the stem cell characteris- 
tics."^^ In a next step towards safer vectors for gene therapy, 
targeting integration to a specific site within the genome, pref- 
erably unique and safe, will be advantageous. Two candidate 
loci are the CCR5 and adeno-associated virus (AAV) integra- 
tion site 1 loci. The CCR5 locus is not essential in humans, 
because individuals carrying a homozygous null mutation 
develop normally and are healthy; AAV integration site 1 is a 
common integration site of the human non-pathogenic AAV.^^ 
Therefore, fusions of LEDGF325.53Q linked to artificial zinc fin- 
gers or transcription activator- 1 ike effectors that specifically 
bind the CCR5 or AAV integration site 1 locus could be gener- 
ated to direct integration towards these safe harbors. 

Our findings open possibilities to engineer viral vectors that 
incorporate LEDGF hybrids to target integration into safe-land- 
ing sites, thereby reducing the risk of insertional mutagenesis. 
Hare and colleagues have reported^^ a set of amino acid sub- 
stitutions in HIV-IN that abolish LEDGF/p75 binding, together 
with mutations in the LEDGF/p75 protein that restore binding. 
Gene delivery vectors could thus use an altered IN/LEDGF 
pair to direct integration, even in the presence of WT LEDGF/ 
p75. To date, the altered IN does not show WT integration 
activity but this may be improved with further engineering.^^ 

In conclusion, we demonstrate that integration can be 
retargeted in the presence of endogenous LEDGF/p75 fol- 
lowing transient expression of CBXI-LEDGF325.53Q, and that 
expression after retargeting remains efficient and can rescue 
the X-CGD disease phenotype in a cell culture model for the 
disease. 

Materials and methods 

Cell culture. SupT1 cells (NIH AIDS Research and Reference 
Reagent Program, http://www.aidsreagent.org/) and X-CGD 
PLB-985 cells, derived from the PLB-985 myelomonocytic 
leukemia cell line by disrupting the gp91P^°^ gene by homolo- 
gous recombination,^^ were grown in RMPI (Gibco, Invitrogen, 
Merelbeke, Belgium) supplemented with 10% heat-inacti- 
vated fetal calf serum (Sigma-Aldrich, Bornem, Belgium) and 
50 [ag/ml gentamicin (Gibco). HEK 293T cells (ATCC, LGC 
Standards S.a.r.l., Molsheim Cedex, France) were grown in 
Dulbecco's modified Eagle's medium with Glutamax (Gibco) 
supplemented with 8% fetal calf serum and 50 ng/ml gentam- 
icin. HeLaP4 cells, a kind gift from Pierre Charneau, Institut 
Pasteur, Paris, France, were grown in Dulbecco's modified 
Eagle's medium with Glutamax supplemented with 5% fetal 
calf serum, 50 ng/ml gentamicin, and 0.5 mg/ml geneticin 
(Gibco). All cells were grown at 37 °C in a humidified atmo- 
sphere containing 5% CO2. Stable monoclonal LEDGF/p75 
KD HeLaP4 cells were generated previously. 

Plasmids and in vitro RNA synthesis. All primers used are 
listed in Supplementary Table S4. All enzymes used were 
obtained from Fermentas (St Leon-Rot, Germany). 

Three plasmids were generated for in vitro RNA synthesis 
using pST1_eGFP27 as backbone plasmid: pST1_CBX1- 



LEDGF3,3.33,BC, pST1_CBX1-LEDGF3,3.33,BC-D366N, and 
pST1_LEDGF/p75BC. BC (backcomplemented) indicates 
that the mRNA encoded by these constructs is resistant 
to the miRNA-based hairpin expressed in the LEDGF KD 
cells. The CBX1-LEDGF3,,.33„BC and CBX1-LEDGF3,,.33„. 
BC-D366N cassettes were PCR-amplified with the primers 
CBX1_s Nhel and LEDGF325 53o_as Sail, using the templates 
pLNC-CBX1 -LEDGF325.53oBC-IRES-BsdR22 and pLNC- 
CBX1-LEDGF325.53oBC-D366N-IRES-BsdR,22 respectively. 
These fragments were digested with Nhe\ and Sa/I, and sub- 
sequently cloned into pST1_eGFP that was digested with 
Spel and Xho\ to remove eGFP The pST1_LEDGF/p75BC 
construct was generated similarly by PCR amplification with 
the primers LEDGF_s Nhel and LEDGF325.53o_as Sail, and 
cloning it into the digested pST1_eGFP backbone. 

The mMESSAGE mMACHINE T7 Kit (Applied Biosystems, 
Gent, Belgium) was used to generate 5'-capped and 3'-polyA 
containing mRNA transcripts, starting from 5 \iq linearized 
plasmid containing a T7 promoter upstream of the target 
open-reading frame. Following linearization of pST1_eGFP, 
pST1_CBX1-LEDGF325.53oBC, pST1_CBX1-LEDGF325.53oBC- 
D366N, and pST1_LEDGF/p75BC with Lgu\, mRNA was syn- 
thesized according to the manufacturer's protocol, resulting in 
four mRNA preparations, encoding eGFP, CBXI-LEDGF32553Q, 
CBXI-LEDGF325.530 D366N, and LEDGF/p75, respectively 

mRNA electroporation. In a first step, protein expression levels 
following eGFP mRNA electroporation were scored, compar- 
ing square wave and exponential decay pulse in different cell 
lines (HeLa, SupT1, PLB-985). Four million cells in Optimem 
without phenol red (Gibco) were mixed with mRNA encod- 
ing eGFP (5 fag). This cell suspension was then transferred 
to a Gene Pulser/Micropulser Electroporation Cuvette with a 
0.4 cm gap (BioRad, Nazareth, Belgium) and subjected to a 
square wave pulse with 500 V and pulse length 5 ms or an 
exponential decay pulse starting at 450 V by a capacitance 
of 150 \iF, using the Gene Pulser Xcell electroporation sys- 
tem (BioRad). In all cell lines tested, the square wave pulse 
resulted in higher expression levels than exponential decay 
pulse (data not shown). Hence, all mRNA electroporations in 
this study were performed using square wave pulse. 

Four million cells in Optimem without phenol red (Gibco) 
were mixed with mRNA encoding eGFP (1 \ig) and CBX1- 
LEDGF3,3.33, (5 ^g) or CBX1-LEDGF3,3.33,D366N (5 ^g) or 
LEDGF/p75 (5 fag) before electroporation with the square 
wave pulse. After electroporation, cells were seeded in the 
appropriate growth medium. 

Plasmids and LV production. LV production was performed 
as described earlier.^^ Briefly, vesicular stomatitis virus gly- 
coprotein pseudotyped HIV-based particles were produced 
by poly(ethylenimine) transfection using pCHMWS_eGFP- 
T2A-fLuc,29 SgpW,34 ^nd SEW^^ as a transfer plasmid for 
LV_eGFP-T2A-fLuc, LV_gp91, and LV_eGFP, respectively. 

Transduction. Electroporated HeLaP4 cells were seeded 
in a 96-well plate 1 day before transduction (20,000 cells/ 
well). All electroporated SupT1 cells and X-CGD PLB-985 
were seeded in a 24-well plate for 24 hours, then reseeded 
in a 96-well plate at 40,000 cells/well and immediately trans- 
duced. After 72 hours, 90% of cells were reseeded into two 
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plates (fluorescence-activated cell sorting analysis and Luc 
assay). The remainder was cultured for qPCR or integration 
site analysis for at least 20 days to eliminate non-integrated 
DNA. 

Expression analysis. Immunocytochemistry and western blot 
analysis were performed as previously described.^^ Briefly, 
LEDGF325.53Q fusions were detected using A300-848a anti- 
body (1/500 dilution; Bethyl Laboratories, Montgomery, TX) 
and Alexa555-labeled goat anti-rabbit secondary antibodies 
(Life Technologies, Ghent, Belgium). All images were acquired 
using a laser scanning microscopy 510 META imaging unit 
(Carl Zeiss, Zaventem, Belgium). Alexa555 was excited at 543 
nm (HeNe laser), and 4',6-diamidino-2-phenylindole (DAPI) at 
790 nm (MaiTai laser; Spectra Physics, Mountain View, CA). 
For western blot analysis, SDS (1%) protein extracts were 
separated by gel electrophoresis. LEDGF/p75 fusions were 
detected using A300-848a antibody (1:2,000; Bethyl Labo- 
ratories) and visualized by chemiluminescence (ECL^'^^; GE 
Healthcare, Diegem, Belgium). Equal loading was confirmed 
with a-tubulin antibody (1:4,000, T-4026; Sigma). 

qPCR. Integrated proviral copies were quantified by real-time 
qPCR on an iQ5 Multicolor RT PGR detection system (Bio- 
Rad) as reported earlier.^^ Briefly, cells transduced with HIV- 
based LVs were cultured for at least 2 weeks to eliminate all 
non-integrated DNA. Genomic DNA was extracted using the 
GenElute Mammalian genomic DNA miniprep kit (Sigma). 
Samples corresponding to 100 ng of genomic DNA were 
used as input for qPCR. Integrated copies for HIV-based LVs 
were measured using a Gag-derived primer-probe set: RT- 
GAG-1, 5'-ATC AAG GAG CCATGC AAATGTT; RT-GAG-2, 
5'-CTG AAG GGTACTAGTAGTTCCTGCTATGTC; RT-GAG 
probe, 5'-FAM-ACC ATC AAT GAG GAA GOT GCA GAA 
TGG GA-Tamra-3'. RNaseP was used as an endogenous 
housekeeping control (TaqMan RNaseP control reagent; 
Life Technologies). All samples were run in quadruplet and 
subjected to an initial 3 minutes denaturation at 95 °C fol- 
lowed by 50 cycles of 1 0 seconds at 95 °C and 30 seconds 
at 55 °C. Data were analyzed with iQ5 Optical System Soft- 
ware (BioRad). Relative numbers of integrated LV copies 
were calculated by normalizing the Gag-signal over RNaseP 
signal. 

The mean VON per cell was determined by real-time qPGR 
using a Roche LightGycler480 (Roche, Mannheim, Germany) 
applying advanced quantification (LightCycler 480 Software 
1 .5.0; Roche) with a reference sample known to harbor one 
single vector integrant in serial dilution. Samples were mea- 
sured in triplicates using 50-100 ng of genomic DNA mixed 
with Roche LC480 Probes Master Mix (Roche). Primers and 
probe specific for the codon-optimized gp9lP^°'' coding region 
and the human EpoR as endogenous standard were used 
to determine the amount of viral sequences per genome in 
a duplex reaction. A serial dilution of genomic DNA from a 
known PLB-985 clone harboring a single lentiviral provirus 
was used for quantification of VCNs. Primers and probe for 
hEpo-R: forward 5'-ATG CCA GAG TAG ACC GAG AC and 
reverse 5'-GGA AAG GAA CTA ACA AAG GGA C, probe 
sequence 5'-TCT TGG GGA CTT TCA CCT GAT TTT CCT 
TCT AC. Primers and probe for codon-optimized gp91 : forward 



5'-CCA GCA GCA CCA AG A CCA T and reverse 5'-CCG ATG 
AAA AAG ATC ACG AAC AG, probe sequence 5'-ACC AGA 
ACA CCT GGA AGT AGC TCC GCC (designed by Primer- 
Design, Southampton, UK). 

Luciferase activity assay Cells were lysed with 70 of lysis 
buffer (50 mmol/l Tris, pH 7.5; 200 mmol/l NaCI; 0.2% NP40; 
1 0% glycerol). The lysate was assayed according to the man- 
ufacturer's protocol (ONE-Glo; Promega, Leiden, The Neth- 
erlands). Luciferase activity was normalized for total protein 
(BCA; Pierce, Aalst, Belgium). 

Integration site amplification. Integration sites were ampli- 
fied as previously described.^^ Genomic DNA was digested 
using Mse\, and linkers were ligated. Provirus-host junctions 
were amplified by nested PGR using bar-coded primers. 
This enabled pooling of PGR products into one sequenc- 
ing reaction. Products were gel-purified and sequenced on 
a 454 GS-FLX instrument (454 Life Sciences; Roche, Basel, 
Switzerland). 

Bioinformatic analysis. For integration sites to be authentic, 
sequences needed a best unique hit when aligned to the 
human genome (hg18) using BLAT, the alignment began 
within 3 bp of the viral long terminal repeat end, and had 
>98% sequence identity. Statistical methods are detailed in 
Berry et al."^^ Random control sites were generated compu- 
tationally, and matched to experimental sites with respect to 
the distance to the nearest Mse\ cleavage site (MRC). In all 
analyses, the distribution of experimental LV integration sites 
is normalized to that of the MRC sites, as a control for recov- 
ery bias due to cleavage by restriction enzymes. Integration 
site counts were compared with MRCs by a Fisher's exact 
test (where stated), or by multiple regression models for inte- 
gration intensity and a c-logit test for significance.^^ Analysis 
was carried out using R (http://www.r-project.org). Frequency 
of integration sites near oncogenes or within tumor suppres- 
sor genes was calculated using a list of genes that were col- 
lected to generate a comprehensive list of cancer-related 
genes (http://www.bushmanlab.org/links/genelists). Epige- 
netic heat maps show the relationship between integration 
sites and the density of 39 histone modifications are sum- 
marized using receiver operating characteristic area. Histone 
modification data determined in primary T cells from Barski et 
al.^^ and Wang et al.^^ were used. Tile color indicates whether 
a chosen feature is favored (blue) or disfavored (yellow) for 
integration in the respective data sets relative to their MRCs. 
Asterisks indicate significant P values when comparing all 
samples (columns) to mock-treated WT sample (overlaid 
with dashes). Detailed information on the statistics and the 
interpretation of the heat maps can be found at doi:10.1371/ 
journal. ppat. 1 001 31 3.s01 1 . 

CBX1 -binding sites were analyzed using data from Vogel 
et al.^^ The number of sequence tags from the ChlP-Solexa 
data sets in a defined window around each LV integration site 
or MRC, was calculated. For each DamID probe set avail- 
able, probes were aligned onto the hg18 using BLAT, and 
their associated log2-binding ratios used to select the top 5% 
of sites. For each integration site or MRC, the average num- 
ber of high-affinity probes within a defined window around the 
site was calculated. 
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Fluorescence-activated cell sorting analysis. gp9lP^°^ was 
detected using the 7D5-FITC antibody (MBL international no. 
D162-4). Granulocytic differentiation was detected using the 
CD11b-PE-Cy7 (clone M1/70) and dead cells were excluded 
using Dead Cell Stain eFluor780 (both from eBiosciences, 
Frankfurt, Germany). All data acquisitions were performed on a 
BD FACSCanto II flow cytometer (BD Biosciences, Heidelberg, 
Germany) and analyzed using the BD FACSDiva 6.0 software. 

Cytochrome C and NBT assay Cytochrome C assays were 
carried out according to Mayo and Curnutte^° using a Spec- 
tra MAX 340 reader (Molecular Devices, Sunnyvale, CA) and 
the SOFTmax Version 2.02 PRO software. The NBT assay 
was performed as described earlier.^^ Briefly, electroporated 
X-CGD PLB-985 cells differentiated to neutrophils were used 
in 96-well plates. To the sample and control tubes, 200 \x\ of 
NBT reagent (1 ng/ml of NBT dye (Sigma) in 0.9% NaCI) were 
added in the presence or absence of 1 ng/ml PMA (phorbol 
12-myristate 13-acetate) (Sigma). Cells were incubated at 
37 °C for 30 minutes. Neutrophils containing the blue/black 
formazan deposits, indicative of superoxide production, were 
classed as NBT-positive and quantified using light micros- 
copy with oil immersion. 

Supplementary material 

Figure S1. Translation of electroporated mRNA in LEDGF 
KD cells. 

Figure S2. Transient expression of CBXI-LEDGF325.53Q in 
LEDGF KD cells rescues LV transduction and retargets LV 
integration into CBX1-rich heterochromatin regions. 
Figure S3. Differentiation status of electroporated X-CGD 
PLB cells. 

Table S1. Integration frequency near mapped genomic 

features in LEDGF KD HeLaP4 cells. 

Table S2. Characterization of single clones. 

Table S3. Vector integration sites in single X-CGD PLB 

clones. 

Table S4. Primers used for cloning. 
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